Long Span Features and Minimum Phoneme Error Heteroscedastic Linear Discriminant Analysis
نویسندگان
چکیده
In this paper we explore the effect of long-span features, resulting from concatenating multiple speech frames and projecting the resulting vector onto a subspace using Linear Discriminant Analysis (LDA) techniques. We show that LDA is not always effective in selecting the optimal combination of long-span features, and introduce a discriminative feature analysis method that seeks to minimize phoneme errors on training lattices. This technique, referred to as Minimum Phoneme Error Heteroscedastic Linear Discriminant Analysis (MPE-HLDA), is shown to be more robust than LDA when applied to long-span features and easy to incorporate with existing training procedures, such as HLDA-SAT and discriminative training of Hidden Markov Models (HMMs). Results on conversational telephone speech and broadcast news corpora also show that the recognition accuracy is improved using features selected by MPE-HLDA.
منابع مشابه
Optimization of class weights for LDA feature transformations
One popular feature type in speech recognition is based on linear transformations of sequences of cepstral feature vectors. In general the transformation is generated in two steps: first a transformation like linear discriminant analysis (LDA) or heteroscedastic linear discriminant analysis (HLDA) is used to maximize separation between classes and reduce the dimensionality, followed by a decorr...
متن کاملDiscriminant spectrotemporal features for phoneme recognition
We propose discriminant methods for deriving twodimensional spectrotemporal features for phoneme recognition that are estimated to maximize the separation between the representations of phoneme classes. The linearity of the filters results in their intuitive interpretation enabling us to investigate the working principles of the system and to improve its performance by locating the sources of e...
متن کاملReview on Heteroscedastic Discriminant Analysis
Discriminant feature spaces are attractive way to improve the word error rate performance of the speech recognition systems. Heteroscedastic discriminant analysis (HDA) is a generalized method for the feature space transformation that does not impose the equa l w i th in c l a s s cova r i ance assumptions required by the standard linear discriminant analysis (LDA). It will be shown that the co...
متن کاملHeteroscedastic linear feature extraction based on sufficiency conditions
Classification of high-dimensional data typically requires extraction of discriminant features. This paper proposes a linear feature extractor, called whitened linear sufficient statistic (WLSS), which is based on the sufficiency conditions for heteroscedastic Gaussian distributions. WLSS approximates, in the least squares sense, an operator providing a sufficient statistic. The proposed method...
متن کاملRecent Advances in Broadcast News Transcription
This paper describes recent advances in the CU-HTK Broadcast News English (BN-E) transcription system and its performance in the DARPA/NIST Rich Transcription 2003 Speech-to-Text (RT03) evaluation. Heteroscedastic linear discriminant analysis (HLDA) and discriminative training, which were previously developed in the context of the recognition of conversational telephone speech, have been succes...
متن کامل